Corpus: eng-ag_web_2017_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 78 86 88 90 90
1000 741 918 956 966 967
10000 4202 7926 9085 9522 9653
100000 8582 20557 25611 27966 28831
1000000 8582 20557 25611 27966 28831


Zipf's diagram for sentence endings


Gnuplot diagram

2034 msec needed at 2018-04-12 08:55